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Abstract 

Mukherjee (Pattern Recognition Letters, vol. 32, pp. 824-831, 2011) recently introduced a class of distance 
functions called weighted i-cost distances that generalize m-neighbor, octagonal, and i-cost distances. He proved 
that weighted i-cost distances form a family of metrics and derived an approximation for the Euclidean norm 
in Z™. In this note we compare this approximation to two previously proposed Euclidean norm approximations 
and demonstrate that the empirical average errors given by Mukherjee are significantly optimistic in 1™ . We also 
propose a simple normalization scheme that improves the accuracy of his approximation substantially with respect 
to both average and maximum relative errors. 



1. Introduction 

The Minkowski (C p ) metric is inarguably one of the most commonly used quantitative distance (dissimi- 
larity) measures in scientific and engineering applications. The Minkowski distance between two vectors x = 
(xi, x 2 , • ■ • , x n ) and y = (j/i, 1/2, ■ ■ • , y n ) hi the n-dimensional Euclidean space, R™, is given by 

£ P (x, y ) = (J2 n l=1 \ x *-y*\ p ) 1/P ■ (!) 

Three special cases of the C p metric are of particular interest, namely, C± (city-block metric), £2 (Euclidean 
metric), and (chessboard metric). Given the general form (TTJ), C\ and £ 2 can be defined in a straightforward 
fashion, while is defined as 



£oo(x,y) = max \xi -y l \. 

l<i<n 

The Minkowski metric enjoys the property of being translation invariant, i.e., £ p (x, y) = £ p (x + z,y + z) for 
all x, y, z G W 1 . Since in many applications the data space is Euclidean, the most natural choice of metric is £2, 
which has the added advantage being isotropic (rotation invariant). For example, when the input vectors stem 
from an isotropic vector field, e.g., a velocity field, the most appropriate choice is to use the £2 metric so that all 
vectors are processed in the same way, regardless of their orientation [lj]. However, £ 2 has the drawback of a high 
computational cost due to the multiplication and square root operations. As a result, C\ and are often used as 
alternatives. Although these metrics are computationally more efficient, they deviate from £ 2 significantly. 

Due to the translation invariance of C p , it suffices to consider D p (x) = C p (x, 0), i.e., the distance from the point 
x to the origin. Therefore, in the rest of the paper, we will consider approximations to C p (x) rather than £ p (x, y). 
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Let D, denned on R™, be an approximation to D2 (Euclidean norm). We assume that D is a continuous and 
absolutely homogeneous function. Recall that D is called absolutely homogeneous (of degree one) if D(Xx) = 
|A|5(x) VA <G R, Vx <s R™. 

We note that all variants of D we consider in this paper satisfy these assumptions. As a measure of the quality 
of the approximation of D to D2 we define the maximum relative error (MRE) as 

-p l^-ftWI . (2) 

x£R™\{0} 

Using the homogeneity of D2 and £>, ([2]) can be written as 

e£ ax = sup |5(x)-l|, (3) 

where S 1 ^ 1 _1 = {x e R n : £>2(x) = 1} is the unit hypersphere of R™ with respect to the Euclidean norm. Furthermore, 
by the continuity of D, we can replace the supremum with maximum in and write 

e£ ax = max |5(x) - 1|. (4) 

We will use JP as the definition of MRE throughout. 

Mukherjee [2j recently introduced a class of distance functions called weighted t-cost distances that generalize 
m-ncighbor Q, octagonal Q, and i-cost Q distances. He proved that weighted i-cost distances form a family of 
metrics and derived an approximation for the Euclidean norm in Z n . Here we briefly review the t-cost norm. 

The i-cost norm Q defines two points in the rectangular grid as neighbors when their respective hypercubes 
(or hypervoxels) share a hyperplane of any dimension. The cost associated with these points can be at most t, 
1 < t < n, such that if two consecutive points on a shortest path share a hyperplane of dimension r, the distance 
between them is taken as min(i,?i — r). There are n distinct i-cost norms defined by 

t 

A(x)=^.t w , l<t<n 
i=i 

where is the i-th absolute largest component of x, i.e., (x(i), 21(2), • • • iX(n)) ls a permutation of \x%\, ■ ■ ■ , \x n \) 
such that xm > x^) > • • • > X(n)- The MRE of this norm is given by [5J 



£ max — max 



y/i- 1,1- 4= 



Mukherjee generalized the t-cost norm as follows 

D M (x) = max {w t D t (x)} , 

l<t<n 

where Wt's are non- negative real constants. Based on this weighted norm, he then derived an approximation for D2 
using the following weight assignment: Wt = l/^/i for 1 < t < n. Note that Dm consistently underestimates D2 
and the corresponding MRE is given by [2[ 

^max 1 ~ ; ^ ' (5) 

In a recent study , we examined various Euclidean norm approximations in detail and compared their average 
and maximum errors using numerical simulations. Here we show that two of those approximations, namely Barni 



et aVs norm P, 
Barni et al 



and Seol and Cheun's norm [8[, are viable alternatives to Dm- 
J 01 formulated a generic approximation for D2 as 



D B (x) = <5^a,5 



•(»)> 



where ct = (cki,Q!2, ••• , a n ) and (5 > are approximation parameters. Note that a non-increasing ordering and 
strict positivity of the component weights, i.e., ct\ > a 2 > ■ ■ ■ > a n > is a necessary and sufficient condition for 
Db to define a norm 0- 
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Barni et al. showed that the minimization of @ is equivalent to determining the weight vector a and the scale 
factor S that solve the following minimax problem: 

min max \Db (x) — 1 1 , 

a.S xEV 

where V = {x € M™ : x\ > x-x > • • • > x n > 0, ^(x) = 1}. The optimal solution and its MRE are given by 

a* = Vi-Vi=T, S* = 2 e°° x = l-6*. (6) 



Note the striking similarity between (|5|) and ([6]). Interestingly, a similar but less rigorous approach had been 
published earlier by Ohashi |9|. It should also be noted that several authors approached the problem from a 
Euclidean distance transform perspective and derived similar approximations for the 2- and 3-dimensional cases, 
see for example (Io| and 11 1. Furthermore, computation of weighted (Chamfer) distances in arbitrary dimensions 
on general point lattices is discussed in (l2l ]. 

More recently, Seol and Cheun 0] proposed an approximation of the form 

A»i(x) = aD 00 (x) + bD 1 (x), (7) 
where a and b are strictly positive parameters to be determined by solving the following 2x2 linear system 



aE(I&) +bE(D 00 D 1 )=E(D 2 D 00 ), 
oEpooDi) + 6E(Z>?) =E( j D 2j Di), 

where E(-) is the expectation operator. 

Seol and Cheun estimated the optimal values of a and b using 100, 000 n-dimensional vectors whose components 
are independent and identically distributed, standard Gaussian random variables. In Q, we demonstrated that a 
fixed number of samples from the unit hypersphere gives biased estimates for the MRE. The basic reason behind 
this is the fact that a fixed number of samples fail to suffice as the dimension of the space increases. 

It is easy to see that Db and D a: b fit into the general form 

n 

5(x) = ^WiXty, 

1=1 

which is a weighted Di norm. For Db the weights are W\ = S* and = 5* a*, whereas for D a b they are 

W\ — a + b and Wj^i = b. Clearly, Db has a more elaborate design in which each component is assigned a weight 
proportional to its ranking (absolute value). However, this weighting scheme also presents a drawback in that a full 
ordering of the component absolute values is required. 

Db and D a ,b can also be written as linear combinations of the D\ and Doo norms, as in ([7]). D\ overes- 
timates the Z?2 norm, whereas D^ underestimates it fl3j ]. Therefore, it is natural to expect a suitable linear 
combination of D\ and D^ to give an approximation to Di better than either of them [14J. Note that Rosen- 
feld and Pfaltz [l5| obtained a 2-dimensional approximation by combining D\ and D^ nonlinearly as follows: 
5(x) =max(L2 (D^x) + 1)/3J , Doo(x)). 

2. Comparison of the Euclidean Norm Approximations 

Due to their formulations, the MREs for D M and D B can be calculated analytically using ([5]) and ©, respec- 
tively. In Figure [1] we plot the theoretical errors for these norms for n < 100. It can be seen that Db is not only 
more accurate than Dm, but also it scales significantly better. 

The operation counts for each norm are given in Table [1] (ABS: absolute value, COMP: comparison, ADD: 
addition, MULT: multiplication, SQRT: square root). The following conclusions can be drawn: 

> Db and Dm have the highest computational cost due to the fact that they require sorting of the absolute 
values of the vector components. 

> D a b has the lowest computational cost among the approximate norms. A significant advantage of this norm 
is that it requires only two multiplications regardless of the value of n. 
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Figure 1: Maximum relative errors for Dm and Dg 



Table 1: Operation counts for the norms 



Norm 


ABS 


COMP 


ADD 


MULT 


SQRT 


Doo 


n 


n - 1 











Di 


n 





71-1 








D 2 








71- 1 


n 


1 


D B 


n 


0(nlogn) 


n - 1 


n 





D a ,b 


n 


71- 1 


71 


2 





D M 


n 


O(nlogn) 


71-1 


71 






> D a .b can be used to approximate D\ (squared Euclidean norm) using an extra multiplication. On the other 



hand, the computational cost of Db (Dm) 
sorting operations involved. 



is higher than that of D\ due to the extra absolute value and 



In Table [5] we display the percentage average and maximum errors for D a .b, Db, and Dm for n < 8. Average 
relative error (ARE) is defined as 

4 g = |^E|5(x)-l|, 

where S is a finite subset of the unit hypersphere 5*2 _1 , and \S\ denotes the number of elements in S. An efficient 
way to pick a random point on S% _1 is to generate n independent Gaussian random variables xi, x%, ■ ■ ■ , x n with 
zero mean and unit variance. The distribution of the unit vectors 



{y = (vi,V2, 



1/2 



1,2,. ...n} 



will then be uniform over the surface of the hypersphere [16j |. For each approximate norm, the ARE and MRE 
values were calculated over an increasing number of points, 2,2,... (that are uniformly distributed on the 
hypersphere) until the error values converge, i.e., the error values do not differ by more than e = 10 _J in two 
consecutive iterations. 

In Table [21 the error values under the column "Dm (R n )" were obtained using the aforementioned iterative 
scheme, whereas those under the column "Dm (Z n )" are taken from Motivated by the fact that Dm consistently 
underestimates L>2, we also experimented with a normalized form of this approximate norm given by Dj^(x) = 
D M (x)/5*. Note that 6* < 1 for n > 2 ©. 
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Tabic 2: Percentage average and maximum errors for the approximate Euclidean norms 





A 


i,b 




D B 




D 




D M 








M 


I 


I" 








n 


A DTTi 
AK-Cj 


Alt? TT 


A r? 


Aim,, 


A/\Tf T? 




IVlarje 


ARE 


MRE e 


ARE 


MREe 


MRE t 


2 


2.00 


5.25 


2.41 


3.96 


3.96 


2.48 


4.12 


2.55 


7.61 


2.40 


7.61 


7.61 


3 


2.39 


9.98 


3.00 


6.02 


6.02 


2.97 


6.40 


4.14 


11.35 


3.63 


11.35 


11.35 


4 


2.57 


13.64 


3.44 


7.39 


7.39 


3.28 


7.97 


5.21 


13.75 


4.29 


13.75 


13.75 


5 


2.68 


16.59 


3.77 


8.39 


8.39 


3.53 


9.16 


5.98 


15.47 


4.65 


15.46 


15.49 


6 


2.73 


18.88 


4.01 


9.19 


9.19 


3.73 


10.12 


6.55 


16.80 


4.85 


16.79 


16.83 


7 


2.76 


20.67 


4.18 


9.84 


9.84 


3.92 


10.91 


7.00 


17.90 


5.00 


17.86 


17.92 


8 


2.77 


21.92 


4.31 


10.39 


10.39 


4.10 


11.59 


7.35 


18.78 


5.04 


18.75 


18.82 



Note that for Dm and Db, two types of maximum error were considered: empirical maximum error (MRE e ), 
which is calculated numerically over S and the theoretical maximum error (MREt), which is calculated analytically 
using ((5|) and (|6]), respectively. 

By examining Tabled the following observations can be made regarding the maximum error: 

t> The most accurate approximation is Db- This is because this norm is designed to minimize the maximum 
error. 

t> The proposed normalization is quite effective since the resulting norm, D^j, is, on the average, only 8.6% less 
accurate than Db, whereas both Dm (K* 1 ) and Dm (Z n ) arc, on the average, about 85% less accurate than 
D B . 

> The least accurate approximations are Dm and D a ,6 for n < 4 and n > 4, respectively. 

> As n is increased, the error increases in all approximations. However, as can also be seen in Fig. [TJ the error 
grows faster in some approximations than others. 

> For Db, the empirical and theoretical errors agree almost perfectly in all cases, which demonstrates the 
validity of the presented iterative error calculation scheme. As for Dm, the agreement in each case is close, 
but not as close as that observed in Db- We have confirmed that using a smaller convergence threshold (e) 
alleviates this problem at the expense of increased computational cost. 

On the other hand, with respect to average error we can see that: 

> -D a , 6 is the most accurate approximation. This is because this norm is designed to minimize the average error. 

> Dm (R n ) and Dm (Z n ) are the least accurate approximations. Furthermore, the errors given by Mukherjee 
are lower than those that we obtained (over K"), and the discrepancy between the outcomes of the two error 
calculation schemes increases as n is increased. The optimistic average error values given by Mukherjee are 
due to the fact that his approximation was primarily intended for use in digital geometry and hence the 
calculations were performed in Z" (rather than l n ) using a very small number of points ranging from 32 to 
512 Q. In fact, Mukherjee used progressively fewer points with increasing n to calculate the error values. In 

we demonstrated that more points are required in higher dimensions to obtain unbiased error estimates. 

In the calculation of D-q-, we assumed that the optimal scaling factor for Dm is the same as that of Db, i.e., 5* . 
In order to check this assumption, we performed a one-dimensional grid search over [5* , 1] for each n value. The 
results are shown in Table |3l It can be seen that: 

> D|j is significantly more accurate than Di-^ with respect to both ARE and MRE. 

> Dj-~ and Db have almost identical MREs. Since Db is analytically optimized for the maximum error it can 
be concluded that D^-~ can reach the same optimality by means of a suitable scaling factor. 

> Interestingly, D^ is more accurate than Db with respect to ARE. This could be due to the fact that the two 
approximations take different paths towards minimizing the MRE. 
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Table 3: Percentage average and maximum errors for Dj-f 





M 




n 


ARE MRE e 5* 


ARE MRE e 5 


2 


2.48 4.12 0.960434 


2.41 3.96 0.961971 


3 


2.97 6.40 0.939809 


2.79 6.02 0.943192 


4 


3.28 7.97 0.926150 


2.99 7.39 0.931336 


5 


3.53 9.16 0.916059 


3.13 8.40 0.922654 


6 


3.73 10.12 0.908117 


3.23 9.18 0.915927 


7 


3.92 10.91 0.901603 


3.31 9.84 0.910619 


8 


4.10 11.59 0.896101 


3.40 10.39 0.905850 



3. Conclusions 

In this paper, we examined the weighted t-cost norm recently proposed by Mukherjee with respect to its 
ability to approximate the Euclidean norm in K ra . We evaluated the average and maximum errors of this norm using 
numerical simulations and compared the results to those of two other well-known Euclidean norm approximations. 
The results demonstrated that, because it was designed for digital geometry applications in Z™, the original weighted 
t-cost norm is not particularly suited to approximate the Euclidean norm in 1". It is also shown, however, that 
when normalized with an appropriate scaling factor, Mukherjee's norm becomes competitive with an analytically 
optimized approximation with respect to both average and maximum relative errors. 
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